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Abstract: Let BSi, . . . , ISn be independent identically distributed random 
variables each having the standardized Bernoulli distribution with param- 
eter p e (0,1). Let m*(p) ;= (1 + p + 2p2)/(2^p - p2 ^ 4p2) if < p ^ i 
and mt(p) := 1 if i ^ p < 1. Let m ^ mt(p). Let / be such a function 
that / and /" are nondecreasing and convex. Then it is proved that for all 
nonnegative numbers ci , . . . , c„ one has the inequality 

E/(ciBSi + --- + c„BS„) sC E/(s('") ■ (BSi + • ■ ■ + BS,0) , 

where s'™' := X/"— i '^i™) ^"^ ■ '^^'^ lower bound m»(p) on m is exact 
for each p G (0, 1). Moreover, E /(ciBSi + ■ ■ ■ + c„ISn) is Schur-concave in 
(cj"", . . . , c^™). A number of related results are presented, including ones 
for the "symmetric" case. 

A number of corollaries are obtained, including upper bounds on gener- 
alized moments and tail probabilities of (super) martingales with differences 
of bounded asymmetry, and also upper bounds on the maximal function of 
such (super)martingales. It is shown that these results may be important 
in certain statistical applications. 
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1. Introduction 

Let £i,...,e„ be independent Rademacher random variables (r.v.'s), so that 
P[ei = 1) = P{ei — —1) = ^ for all i. Let Z ~ N{{), 1). Let oi, . . . , a„ be any 
real numbers such that 

a\ + ■ ■ ■ + a?^ — I. 

The sharp form, 

E/(eiai + ---+e„a„) < E/(Z), (1.1) 

of Khinchin's inequality 171] for /(x) ee \xY' was proved by Whittle (1960) |S7| 
for p > 3 and Haagerup (1982) [HI for p ^ 2. 

For f{x) = e^^ (A ^ 0), inequality ^ follows from Hoeffding (1963) [TH], 
whence 

P(eiaiH + £„a„ ^ x) ^ inf — r— = 6^=" Vx ^ 0. (1.2) 

Since P(Z ^ x) ^ — -, — e^^^^^ as a; — > 00, a factor x i is "missing" here. 
The apparent cause of this deficiency is that the class of exponential moment 
functions f{x) = e^^ (A ^ 0) is too small (and so is the class of the power 
functions f{x) = \xY'). 

For all a ^ 0, consider the following much richer classes of functions / : M — > 

K: 

ni:^{f: f{x)^ S'^^{x~t)%ii{dt) VueR}, (1.3) 

where /i ^ is a Borel measure, x+ :— max(0, a;) and x" :— (x+)" for x £ M, 
0" :— 0; however, the subscript -|_ will have a different meaning when used with 
functions or classes of functions (as, for example, in the symbol 7i"). 
It is easy to see Proposition l(ii)] that 

^ /3 < a implies HI C U^. (1.4) 

Proposition 1.1. For natural a, one has f G if and only if f has finite 
derivatives /^"^ := /, f^^"^ :=/',.■•, f^°'~^^ on M such that is convex on 

R and f'^3'>{-oo+) = /or j = 0, 1, . . . , a - 1. 

It follows from Proposition ll.il that, for every t eM., every /3 ^ a, and every 
A > 0, the functions u i~+ (u — t)^ and u 1-^ ^H^-t) belong to . 

Eaton (1970) IJj proved the Khinchin-Whittle-Haagerup inequahty Hl.l|) for 
a class of moment functions, which essentially coincides with the class H^; see 
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|27l Proposition A.l]. Based on asymptotics, numerics, and a certain related in- 
equality, Eaton (1974) 13> conjectured that the mentioned moment comparison 
inequality of his implies that 

P (eioi H h £,ia„ ^ x) ^ ^ ^e"^^/^ Vx > V2. 

9 xV2^ 

Pinelis (1994) [27j proved the following improvement of this conjecture: 

P(£iaiH +£„a„ ^ a;) «C — P(Z ^ a;) Vx S K. (1.5) 

It was realized in Pinelis (1998) 28 that the reason why it is possible to 
extract tail comparison inequality H1.5|l from the Khinchin-Eaton moment com- 
parison inequality Hl.l|l for / G T-C'\_ is that the tail function x i-^ P{Z ^ a:;) 
is log-concave. This realization resulted in a general device, which allows one 
to extract an optimal tail comparison inequality from an appropriate moment 
comparison inequality. The following is a special case of Theorem 4 of Pinelis 
(1999) |2i; see also Theorem 3.11 of Pinelis (1998) [2H|- 

Theorem 1.2. Suppose that ^ /3 ^ a, ^ and rj are real-valued r.v. 's, and the 
tail function u P{r] ^ u) is log-concave on R. Then the comparison inequality 

E/(C) < E/(r7) for all fe HI (1.6) 

implies 

E /(O ^ c^,p E /(t?) for all f e (1.7) 
and, in particular, for all real x, 



where 



s$ min ( c„,o P(?? ^ x), mf e"''^ E e'"' ) , (1.10) 

_ T{a + l)[e/ar ,^ 
^"''3- r(/3-|-l)(e//3)/3 ^'-''^ 

for j3 > 0; c^o := r(a-|-l)(e/a)". Moreover, the constant Ca^p is the best possible 
in ifTTjl and ifTTTUIl . 

A similar result for the case when a — 1 and /? = is contained in the book 
by Shorack and Wellner (1986) pages 797-799. 

Note that Cq,o ~ \ lna and Ca,/3 ycejp as a, /9 — > oo. 
Remark 1.3. As follows from |28l Remark 3.13], a useful point is that the re- 
quirement of the log-concavity of the tail function q{u) :— P(r/ ^ u) in Theorem 
ll.2l can be relaxed by replacing q with any (e.g., the least) log-concave majorant 
of q. However, then the optimality of Ca^/s is not guaranteed. 
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Detailed studies of various cases and aspects of the optimal bound -Bopt(2;) in 
were presented in [TTIEHIIZI- 

Note that ca^o = 2e'^/9, which is the constant factor in H1.5|l . Bobkov, Gotze, 
and Houdre (BGH) (2001) ^ obtained a simpler proof of inequahty (|1.5() . but 
with a constant factor 12.0 in place of 2e^/9 « 4.46. In the BGH method 
was modified to obtain a version of l|1.5|l with a constant factor 3.22, which is 
K, 1.01 times the least possible constant factor in (|1.5|) . Edelman |2| proposed 
inequality P(5„ ^ x) ^ P (Z ^ x — 1.5/a;) for all x > 0, but his proof appears 
to have a gap. A more precise upper bound, with Incs^o = 1-495 ... in place of 
1.5, was recently shown to be a rather easy corollary of 1)1.511 . 

Remark 1.4. One also has two kinds of multi-dimensional analogues of Hl.l|) and 
p.5|l . One kind is represented by [23 Theorems 2.3 and 2.4]. The other kind 
is based on the dimensionality reduction device given in Indeed, Remarks 
in imply (cf. the proof of Lemma 3.2 in |22]) that, for any even function / 
in class J-^ (which contains Ti\_ and is defined by H2.14|l below), the function 
[0, oo) 9 u I— > f{y/u) is convex. Therefore, by 30, Theorem 2.1], 

E/(||£iXi + --- + e„x„||) s; E/(|Z|) for all even / in .^3, (1.12) 

where Xi, . . . , x„ are any non-random vectors in a Hilbert space (iJ, || • ||) such 
that ||xi|p + ■ • ■ -I- ||x„|p 1. It follows that p. 12(1 holds for all functions / 
given by /(x) :— Eg{£x) {x G R), where g G Ti.^ and e is a Rademacher r.v. 
Hence, by Theorem ll.2l 

P(||£ixi + --- + £„x„|l >x) ^ — P(|Z| >x) VxeR, (1.13) 

y 

More generally, in view of a result by Hunt [20] , inequalities Hl.l|l , (|1.5|l , (|1.12|) , 
and (|1.13|) hold if ei, . . . ,e„ are replaced there by any independent zero- mean 
r.v.'s 771, . . . ,77„ such that |77i| ^ 1 almost surely (a.s.) for all i. 

Pinelis (1999) also obtained the "discrete" improvement of H1.5|l : 

/ 1 \ 
P (eiai + • • • + £„a„ ^ x) ^ — P [-^i^i + ■■■ + £«) > xj (1.14) 

for all values x of r.v. -^(£1 + • ■ ■ + £„). 

Such results can be, and have been, extended in several different directions. In 
what follows, let {Sq, Si, . . . ) be a supermartingale relative to a nondecreasing 
sequence of a-algebras {H^o, H^i, . . . ), with Sq ^ a.s. and differences 

'.= Si .S'^— 1, 2 = 1,2,.... 

The following normal domination statement is one of the main results of [32| . 

Theorem 1.5. |32) Suppose that for every i = 1,2,... there exist _ff<j(,;_i)- 
measurable r.v. 's Ai^i and and a positive real number Ci such that 

^ ^ and (1.15) 

i(A,_i +B,_i) ^ c, (1.16) 
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a.s. Then for all f G Ti']_ and all n — 1,2, .. . 

Ef{Sn)^Ef{sV^Z), (1.17) 

where 

, ,(1) :^ ./i + ZH. (1.18) 
V n 

Note that inequality (|1.17|) for the smaller class of exponential functions in 
place of the class is due to Hoeffding . 

By virtue of Theorem ll.2l one has the following corollary. 

Corollary 1.6. |32| Under the conditions of Theorem \1.5V for all (3 € [0,5], all 
f G Ti^, and all n = 1,2, .. . 

Ef{S„)^C5,pEf{s^Z). (1.19) 

In particular, for all real x, 

P(5„^x)^ inf iZ^^ (1.20) 

/e-H^ f(x) 

5 



E(sV^Z- i)^ 

te(-oo,a;) (x — t) 



inf ^ ^ (1.21) 



min ( C5 P(sy/^Z ^ x), inf e"''^ E e'^'^A (1.22) 

\ ' h>0 ) 

= min (c5,o P ^ 17^) ,exp (-^)) . (1.23) 

The upper bound (|1.23|l - but with a constant factor greater than 427 in 
place of C5,o — 5!(e/5)'^ — 5.699 . . . was obtained in Bentkus (2001) [T for the 
case when (Si) is a martingale. (In this case, Bentkus was using direct methods, 
rather than a generalized moment comparison inequality such as ()1.17|) .'l The 
large value, 427, of the constant factor renders the bound in pj hardly usable 
in statistics. Indeed, the upper bound 427 P ^ 77^) improves the Hoeffding 

bound exp (^~2^p-^ only when > 170, in which case (in view of (|1.23|) ) one 
has P(5'„ ~^x)< c5,o P (^ ^ 170) < lO-^^^oo^ 

As shown in |32| , Theorem 11.51 and Corollary 11.61 are well suited in order 
to obtain the most precise presently known bounds for the measure concentra- 
tion phenomenon in terms of separately-Lipschitz (or, equivalently, ^^-Lipschitz) 
functions on product spaces. 

Theorem II . 51 can be further improved, as follows. 

Theorem 1.7. |32| Suppose that for every i = 1,2, .. . there exist a positive 
H^(^i_iymeasurable r.v. and a positive real number Si such that 



X, and (1.24) 

Vari_i; 

2 V ^ 



1 / Vari_i Xi . , . 

- B.-i + ' s=: (1.25) 
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a.s. Here and elsewhere, we let Var^ stand for conditional variance given H^j. 
Then one has all the inequalities (|1.17|) and (|1.19|l - (|1.23l) . only with s replaced 

by 

/ S2 I ... I a2 

(1.26) 



n 

The set of conditions (|1.24(l - (|1.25|l is equivalent to 



Xi and (T,(Bi_i, Var^^i X;) ^ Sj 



a.s., where 




for positive c and 60. This follows simply because the inequalities Xi ^ 
and & ^ Sj-i imply ^ 6. 

Thus, in the case when Var^^i Xi < Bf_^ a.s., conditions (|1.24|) - p.25|l repre- 
sent an improvement of condition Bf_^ V Var^^i Xi ^ sf a.s., imposed in PIE]. 
In a certain variety of cases, this improvement may be even more significant 
than the improvement in the constant factor from 427 to 5.699 . . . before the 
probability sign. 

Moreover, it can be shown that the function (7*(-, •) of the pair 

Vari_i Xi), which is in effect used in Theorem 11.71 is nearly optimal as 
far as the normal domination is concerned. 

However, it can also be seen that even the best possible normal domination 
may be inadequate if the asymmetry of the random summands Xi is significant 
or if n is not large. In such a case, one may try to use binomial domination in- 
stead of normal, as in the following theorem, which is a straighforward corollary 
of results of P] (or |3|). 

Theorem 1.8. [30] Suppose that for every i — 1,2, . . . there exist non-random 
constants bi > and Ci > such that 

Xi ^ bi and (1.27) 
Var,_iX, sSc^ (1.28) 

a.s. Then, for all n — 1,2, ... , 

E/(5„) E/(T„) yfenl, where (1.29) 
T„ := Zi + • • • -I- Zn (1.30) 

and Zi, . . . , Zn are i.i.d. r.v. 's such that each Zi takes on only two values, one 
of which is 

b := max6i, (1-31) 

i 
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and satisfies the conditions 

EZj ^0 and VarZ,- = c^, wh 



ere 




(1.32) 



that is, P{Zi^b)^— and P ( = - — 

r + c' \ 

Based on this result, the tail comparison inequality 

P(5„ ^y) s$c2,oP''"'^(r„ ^ + VyeR (1.33) 

was obtained in PJj, where C2,o = = 3.694 ... (in accordance with p.ll|l '). 
h := b + c^/b, and the function y P'"'"'^*"(r„ ^ y) is the least log-concave 
majorant of the linear interpolation of the tail function y i— > P(r„ ^ y) over the 
lattice of all points of the form nb+kh {k ^ Z). Tail comparison inequality H1.33|l 
is a substantial improvement of the corresponding inequality in 2, Theorem 1] 
and gl Theorem 1.1]. 

Yet, while the "variance" -averaging given by (|1.32|) is nice, the extreme kind 
of upper-bound-averaging (|1.31|) seems very crude. 

In this paper, another approach to the problem of asymmetry is presented. 
Here we provide binomial upper bounds on generalized moments and tail prob- 
abilities for Sn assuming that certain indices of asymmetry of the X^'s (rather 
than the X^'s themselves) are uniformly bounded from above. This assumption 
of bounded asymmetry (in contrast with the uniform boundedness) of the X^'s 
is natural in certain statistical applications; see Subsection l3.2l 

The results of |H] for the symmetric case can be similarly complemented, with 
condition l|1.31|l of the uniform boundedness of the ranges now replaced by a 
condition of uniform boundedness of the kurtoses of the X-'s; see Remark [ 



2. Statements of basic results and discussion 

Let BS(p) denote the standardized Bernoulli distribution with parameter p: for 
a r.v. BS let, by definition, 

Bs^BS(p) ^ p(BS = y|) =p = i-p(m = -^), 

where 

q := 1 — p and < p < 1; 

thus, BS(p) is a two-point zero-mean unit-variance distribution. In particular, 
BS(^) is the distribution of a Rademacher r.v. e, with P(e = ±1) = i. 

Let denote the class of all twice continuously differentiable functions 
/ : M ^ K. Consider the following class of functions: 

JF^ := {/ e : / and /" are nondecreasing and convex}. (2-1) 
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An equivalent definition would be given by the formula 

Tl = {f eC^: f, f, f", and /"' are nondecreasing}, 

where /"' denotes the right derivative of the convex function /". 

For example, functions x i— > a + bx + c{x — t)'^ and xi— >a + 6x + c e^^ belong 
to Tl for all a e K, 6 ^ 0, c ^ 0, i G K, a ^ 3, and A ^ 0. 

Remark 2.1. If a function /: M ^ K is convex and a r.v. X has a finite ex- 
pectation, then, by Jensen's inequality, Ef{X) always exists in (—00,00]. This 
remark will be used in this paper (sometimes tacitly) for functions / in the class 
J^^l, as well as for other convex functions. 

Throughout the paper, unless indicated otherwise, the following notation/ 
assumptions will be used: 

me [1,00), pe(0,l), q^l-p, and BSi, . . . , BS„ ' BS(p). (2.2) 
Introduce also 

1 + P+ 2p2 



if o<p«;i 



m,(p) := <( 2(^VP^+2p2j (2.3) 
.1 if i^p<l. 

Later it will be clear that to* (p) increases from 1 to 00 as p decreases from ^ to 
(see the proof of Lemma [4. 1.1 711 . 

Of the main results of this paper, the following one is perhaps the easiest to 
state (but not to prove). 

Theorem 2.2. For any real number 

TO ^ m^{p), 

all f £ J-^, all natural n, and all nonnegative numbers ci, . . . ,Cn, one has 

E /(ciBSi + • • • + c„BS„) ^ E / • (BSi + • • • + BS„)) , (2.4) 




where s^") - - Vc^™ . (2.5) 



Moreover, the lower bound m^,{p) on m is exact for each p G (0, 1). 

The proofs are deferred to Section^ 

Remark 2.3. The general restriction to ^ 1 in (|2.2|l is quite natural. Indeed, if 
inequality H2.4I) held for some to G (0, 1) then, taking ci = 1, C2 = • • • = c„ = 0, 
and letting n —> 00, one would have, by the central limit theorem, the inequality 
E/(BSi) < /(O) for all / G T^, which is false even for f{x) = e"^ or f{x) = x^. 
Here is a generalization of Theorem 12. 21 
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Theorem 2.4. Let Xi, . . . , X„ he independent r.v. 's such that for every i € 
{l,...,n} 

^ and — ai ^ Xi ^ bi a.s., 
where ai and hi are positive numbers such that 

^ < 1 (2.6) 

a, p 

Then, for any real number m ^ to* (jj) and all f G J-\ , one has the inequality 
E /(Xi + ■ ■ ■ + X„) E / • (ffii + • ■ ■ + BS„)) , where (2.7) 



n ■ 
1=1 



Moreover, the lower bound m^{p) on to is exact for each p G (0, 1). 

Condition (|2.6() may be referred to as a bounded-asymmetry coundition. 
Theorem 12.41 can be easily extended to (super)martingales. 

Theorem 2.5. Suppose that for every i G {1, . . . , n} one has (I1.15|) wii/i positii 
Ai^i and smc/i that 



yAiZiBiZi ^ Ci anrf (2-8) 

^ ^ (2.9) 

Ai-i p 

a.s., where Ci is a non-random number. Then, for any real number 

TO ^ m^{p) (2.10) 
and f G J-'^, one has the inequality 

Ef{Sn) ^ E / • (BSi + • • • + BS„)) , (2.11) 



where s^"^^ is defined by (|2.5II . Moreover, the lower bound m^{p) on m is exact 
for each p G (0, 1). 

One should compare (|2.8() and H2.5() with (|1.16() and H1.18|l . If Xa^b stands 
for a zero-mean r.v. taking on values in the set {—a, 6} for some positive a and 
6, then obviously the half-range ^(a + 6) of Xa^ is no less than its standard 
deviation \/ab. That is, (|1.16(l is more restrictive than H2.8|l . On the other hand, 
one has the inequality s*^™-' ^ s*^^^ for to ^ 1. Moreover, the greater the uniform 
bound ^ on asymmetry in (|2.9(l is, the greater to must be according to H2.1()|l 

and hence the more pronounced the inequality s^'") ^ s^^^ will be. Yet, it will 
be demonstrated elsewhere that, overall, 1)2. 8|l and H2.5|) work better in certain 
important statistical applications than (|1.16l) and (|1.18() . Note also that one 
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can choose the "ideal" value m = 1 whenever the asymmetry index ^ does not 
exceed 1, that is, whenever the XiS are not skewed to the right. 

Recall the definition of the Schur majorizarion: for a := (ai, . . . , a„) and 
b := . . . , bn) in M", a ;>= b means that oi + • • • + a„ = fei + • • • + fe„ and 
a[i] + • • • + a[j] ^ b[i] + • • • + b[j] for all j £ {1, • • • , n}, where a[i] ^ ■ ■ ■ ^ a[„] 
are the ordered numbers ai, . . . , a„, from the largest to the smallest. Recall also 
that a function Q: [0,oo)" ^ R is referred to as Schur-concave if it reverses 
the Schur majorization: for any a and b in [0,oo)" such that a )>= b, one has 
Q(a) < Q(b). 

Theorems 12.21 and 12. 51 are contained in 

Theorem 2.6. The following statements are equivalent to one another. 
(I) m ^ m.j,{p). 

(II) For all f G J-\. all natural n ^ 2, and all nonnegative numbers ci, . . . , c„, 
one has (|2.4() . 

(III) The same as item (pljl, but only for n = 2. 

(IV) For every natural n ^ 2 and every function f G J-'l, the function 

[0, oo)" 9 (ai, . . . , a„) ^ E /(a^^'^^BSi + • • • + ay(2™)BS„) (2.12) 

is Schur-concave. 
(V) T/ie same as item ljIV|l . &uf onZt/ /or n = 2. 

(VI) Let t/ie 's, 's, 5^ 's, and s^™) 6e as m the statement of Theorem \2.4\ 

Then one has (|2.7() for all natural n. 
(VII) Let the Xi's, Ai^i's, Bill's, ct's, and s^™-* be as in the statement of 
Theorem \2.fA Then one has (|2.11|l for all natural n. 

The special case of statement (|IV|I of Theorem 12.61 with p = ^ and m = 1 is 
essentially the mentioned result due to Whittle 37 and Eaton |12| . 

From the "right-tail" Theorem 12.61 one can deduce its left-tail and two-tail 
analogues. Appropriate left-tail and two-tail counterparts of are the following 
classes of functions: 

:= {/ G : / and /" arc nonincreasing and convex} (2.13) 
= {f:3geTlyxeR f{x)=gi-x)} and 
J"^ ■.= {f eC^: f and /" are convex}. (2.14) 

Theorem 2.7. Theorem \2. 61 holds with T"^ in place of J-^ if the restrictions (i) 
m 5^ m^(p), (ii) EXi ^ 0, (iii) — ^ — , (iv) (So,.-.,Sn) is a supermartingale 

Oi p 

B ■ \ q 

with So ^ 0, and (v) — ^ — in Theorem \2. 61 are replaced, respectively, with 

Ai-i p 

the following: 

(i) m > rn^{q), 

(ii) EX, ^0, 

(ill) r ^ ~' 

Oi q 
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(iv) (5*0, . . . , Sn) is a submartingale with So ^ 0, and 

^""^ ~R — ^ 7,- 

t>i-i q 

This "left-tail" analogue is a trivial corollary of Theorem 12. 51 
The "two-tail" analogue of Theorem 12. til is more difficult to prove. It relies in 
part on Proposition 12 . 91 below, preceded by the following definition. 

Definition 2.8. Let us say that a sequence of functions (/„) in converges 
to a function / in and write /n — *■ / (as n oo) if fn{x) t f{x) and 
/n(^) ~^ f'i^) foi' i'6al X. (This stronger notion of convergence will make it 
easier to verify the convergence of relevant expected values; also, it naturally 
provides for the relevant classes of functions to be closed.) 

For any subset ^ of , its closure - denoted here by cl ^ - will be understood 
here simply as the set of the limits of all sequences in A that are convergent in 
C^. Obviously, c\A2A, for aU AQC^. 

Obviously, the "two-tail" class J-^ contains both "one-tail" classes J-'l and 
. The more informative relation of to JF^ and JF^ (on which the proof of 
Theorem 12 . 101 below is partly based) is given by 

Proposition 2.9. One has = c\Q^ , where 

:= {/ e : 3c ^ 3/+ e TX 3/_ e J"! Vx G M 

/(x) =0x2/2 + /+(a;) + /_(a;)}. (2.15) 

However, ^ . 

For example, functions xH^a + fex + cx^ + dja; — t|", x ^ cosh Ax, x i— > e"^^, 
x^ (x-t)%, and x (t - x)^ belong to T"^ for all a e M, 6 e M, c ^ 0, d > 0, 
t G K, a ^ 3, and A e R. 

Note also that the classes T^_, and are convex cones; that is, any 
linear combination with nonnegative coefficients of functions belonging to any 
one of these classes belongs to the same class. 

Theorem 2.10. Theorem \2. 61 holds with J-'^ in place of J-^ if the restrictions 
(i) TO ^ m^{p), (ii) EXi ^ 0, (iii) ^ ^ p, (iv) (5*0, . . . ,Sn) is a supermartingale 
with So ^ 0, and (v) ^ ^ in Theorem \2.b\ are replaced, respectively, with 

the following stronger restrictions: 

(i) TO ^ m^:{p) and p ^ ^, 

(ii) EX, = 0, 

f-\ ( ^» ^ 9 

(ill) max — , 7- < 

V ai bi J p 

(iv) (5*0, . . . , Sn) is a martingale with So = 0, and 



(v) max , s; -. 
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3. Applications 

3.1. Bounds on even richer classes of generalized moments 

(including tail probabilities) , maximal inequalities, and some 
further extensions 

Using Theorem 12.51 and Remark 11.31 (and also recalling definition (|2.1() and 
Proposition ll.l|l . one immediately obtains the following corollary, which may be 
compared with Corollary 2.2 in 32 . 

Corollary 3.1. Suppose that conditions H1.15|l . (12.8(1 . ((2.9|1 . and (|2.1Q(I hold. 
Then for all (3 G [0, 3], all f G 7i+, and all n = 1, 2, . . . 

E/(5n) «;C3,/3E/(T„), (3.1) 

where 

r„:=s(™).(BSi + --- + BS„). 
In particular, for all real x, 

P{Sn>x)^ inf (3.2) 

fen\ fix) 

= inf ; " ^,;+ 3.3 

< min P'-^iT,, ^ x), inf e"^^ E e^^A , (3.4) 



= min (^C3,o P'^(T„ ^ :c), e-"" j , (3.5) 

where x i—> P^''(T„ ^ x) is the least log-concave majorant of the function x i— > 
P(r„ >x) onR;H:= (p + y)ln^ + {q - y)ln^ ,/ Q ^ y ^ ^ < q, 
H := — hup if y ^ q, H := oo if y > q, and H := if y < . 

Note that P^'-(r„ > x) = P(r„ > x) for aU x in the lattice 

L := {nb + kh: k e Z} 

generated by the support of the distribution of r„, where b := ^^'"■'y^ and 
/i:=sM/^. 

The bound e^"^ in ((3.5(1 is largely due to Hoeffding [18j . 

Using also results of PJjj one has the following. 

Corollary 3.2. Under the conditions of Corollary 

P{Sn ^ x) < c3,o P'"''^(T„ ^ x + I) Vx e R, (3.6) 

where x i— > P'"'"'^''(T„ ^ x) is the least log-concave majorant of the linear inter- 
polation of the tail function x i— > P(T'„ ^ x) over the lattice L. 
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The upper bound in 1)3.6(1 usually works better than that in Il3.5|l in statistical 
practice. An explicit formula for P'"'"'^''(T„ x + ^) is given in |31| . 

Corollary 3.3. In view of results of J^^, one can replace S'„ in the left-hand 
side of inequalities (|3.2|) and (|3.6|) by 

Mn '■= max Sk- 

Similarly, inequality 1(3. l|l holds for all (3 € (0, 3) with M„ in place of Sn if ca^fs 
is replaced there by """'^'^ cs a, where, for (3 G (0, a), 

Ki-Qfl:= super '^^ '[ / and ka n :^ . 

Corollary 3.4. The set of conditions p.l5() . 1(2. 8() . and ((2.9() m Corollaries \ii.ll 
and \cl.ci\ can be replaced by the set of conditions ((1.27() . p. 28(1 . and 

V« = l,...,n. (3.7) 

p 

In fact, the two sets of conditions mentioned in Corollary 13.41 are equivalent 
to each other in a certain sense; see e.g. Remark 2.4 in 32 and the proof of 
Theorem 2.3 therein. 

Note that the special case of Corollary 13 . 41 with p = ^ (so that one may take 
m = 1) contains, among other things. Theorem 1.3 of U, which states that, if 
(Si) is a martingale with 5*0 = satisfying conditions ((1.27() with 

bi ^ Ci Vi = 1, . . . ,71 

and ((T^ . then Vx G R 

P(5„ ^x) «Cc3,oP^^(T„ ^a:) with T„ = s(i)-(ei + ••• + £„); (3.8) 

by the central limit theorem, this inequality implies 

P{Sn C3,o P{s^^^V^Z ^ x). (3.9) 

Obviously, inequalities ((3.9(1 and 1(3.8(1 are extensions of ((1.5() and 1(1.14(1 . respec- 
tively. A version of inequality 1(3.9(1 . with the larger constant factor 1/ P{Z > 
•\/3) = 24.01... in place of ca^o = 2e^/9 = 4.46..., appeared earlier in [3]. 
The improvement in the constant factor achieved in as compared with [3], 
is due to replacing the direct method used in the earlier paper with the method 
based on Theorem ll.2l and Remark 1 1.31 which allows one to extract optimal tail 
comparison inequalities from comparison of generalized moments. 

More generally, the generalized moments E f{T„) in the above upper bounds, 
where T„ := s^™) • (BSi + • • • + BS„), can be replaced by E/(s'^^^y^Z) provided 
that p ^ 5- 

That (Sq , Si, . . .) is allowed to be a supermartingale (rather than only a 
martingale) makes it convenient to use the simple but powerful truncation tool; 
cf. the discussion at the end of Section 2 in 32 . 
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Remark 3.5. In the above results, the exact lower bound m^{p) on m can be 
replaced by a substantially smaller (for p G (0, ^)) exact lower bound rriexpip) 
on in if the class JF^ is replaced by the substantially smaller class 

■= {/ €^ : /f^) ^ Vj e {0, 1, . . . }} 

of completely monotone functions, where is the class of all infinitely dif- 
ferentiable functions /: R ^ M with derivatives /(°) := fj'^^^ := /',.... By 
Bernstein's theorem on completely monotone functions (see, e.g., ^01 or |26j). 

T? = {f: fix) = /[o,oo)e'X*) Va; e R}, 

where ^ ^ is a Borel measure such that the integral /jp e''^fj,{dk) is finite 
Va; S M. On the other hand, by (|1.4|) and Proposition ll.il 

= {/i + c: /i G Pi c^O}. 

By Remark 12.31 the exact lower bound mexp (p) cannot be less than 1 , so 
that mexp(^') = m^,{p) = 1 for all p e [5, 1). However, for p e (0, ^), TOexp(p) is 
substantially smaller than m^{p). In particular, when p I 0, one has mexp(p) ~ 
ilni, while m^,{p) ^ ^'^^ P ^ (^i 5)' exact lower bound mexp(p) can 
be described by parametric equations 



lexp(p) - m(fc) := ^^^r^' 
p = p{k) 



(3.10) 



(e'=-l)(l + fc + (fc-l)e'=)' 



with A: g (0, 00) as the parameter. One can see that p{k) decreases from ^ to 
as k increases from to 00. An explicit upper bound on mexp(p) is given by the 
inequality 



TOexp(p) < TOexp,up(p) := 2( 1 - 2p)^^'' ^ ^^'^^^^ 



so that one has m > rriexpip) provided that m ^ Wexp, up(p)- This simple upper 
bound on mexp(p) is rather good: mexp, up(p) m.exp(p) 5 In | as p i 0, 
rriexp, up(p) ~ mexp(p) 1 asp t 5, and max (mexp, up{p) / m^xpip)) = 1-32 . . . , 

0<p<l/2 

the maximum attained at p = 0.019 .... 

On the other hand, the equation p{k) = p has a unique solution k =: k{p) in 
(0,00) for every p G (0, i). Moreover, the Newton iterative scheme 
kj+i := kj - F{kj)/F'{kj) for j = 0, 1, . . . , where F{k) := F{p, k):=e^ -l-k 
+ {1 + k ~ 2e^ + 6^*^(1 — converges to k{p) for every p e (0, i) and every 
initial approximation k^ > — 1). Thus, one has Wexp. up(p) — rh{k{p)) for 

allp € (0, The following table illustrates the fact that mexp(p) is substantially 
smaller than m* (p) for small p: 
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p 


10-1 


10-2 


10-3 


10-4 




1.75 


5.06... 


15.83... 


50.00... 


rriexpip) 


1.21... 


1.86... 


2.72... 


3.68... 



In particular, it follows that if p is small then the second upper bound in 
H3.4|l - H3.5|l . e-"^, holds for significantly smaller values of m than the first upper 
bound in (|3.4|I - H3.5|I . C3_o P^'~{Tn ^ x), does. Thus, the exponential upper bound, 
e~"^, may turn out to be smaller than the generally more precise upper bound, 
C3,o P^'~{Tn ^ x), even for large values of x if the X^'s differ very much from one 
another in distribution. Details on this remark, Remark 13.51 will be presented 
elsewhere. 

Remark 3.6. The stated results for the asymmetric case have "symmetric" coun- 
terparts, in which the standardized Bernoulli distribution BS(p) = ^fqfp ^ 
'^^-sfpR replaced by the standardized symmetric three-point distribution 
ST(p) := 2 '^yi/^ + (1 — p)<5o + f^-y^iT^' where 6a is the Dirac probability 

measure concentrated at one point a and p G (0,1]- (At that, the class T\ 
may be replaced by the larger, "two-tail" version . Note also that, since the 
distribution BS(p) is symmetric, one can assume without loss of generality that 
all generalized moment functions / involved are even; indeed, for a symmetric 
r.v. X, one has E/(X) = Eg(X), where g{x) := \ (/(x) + !{-x))) Then the 
exact lower bound m*(p) on m gets replaced by another exact lower bound, 
whose general expression is however more complicated than expression (|2.3|l for 
m»(p). In fact, the exact lower bound on m (denoted here by mst, Schur(]3)) for 
the symmetric-case analogues of the Schur-concavity statements in Theorem l2.6l 
turns out to be strictly greater for some values of p than the exact lower bound 
on m (denoted here by TOst(p)) for the symmetric-case analogues of the above 
"asymmetric" results other than Schur-concavity. 

A simple upper bound on mjt. schur(?') a-nd hence on mst{p) for all p € (0, i] 
is given by inequality 



TOst(p) < ™st,Schjr(p) =^ ™st,high(p) := m*(r) 



5 - 3^1 - 2p - 2p 



4(. 



1 



VT^-p) 

(3.12) 

where the function is defined by H2.3|l and r := (1 — -y/1 — 2p)/2 is the root 
in (0, i] of equation 2r(l ~ r) = p. This follows because the convolution of the 
standardized Bernoulli distributions BS(r) and BS(1 — r) is a symmetric three- 
point distribution, which ascribes probability 1 — p to {0} and therefore can 
be obtained from the standardized symmetric three-point distribution ST(p) by 
simple re-scaling. One can see that rrist, high(p) as p | and TOjt, high(5) = 

1; let also mst,u\gu[p) ■— 1 for all p S (i, 1). 

A remarkable and not so difficult to prove fact is that 

= \/2 - 1 = 0.4142 ... . 



TOst(p) = 1 Vp e 1], 
More generally, it appears that 
TOst(p) = TOl(p)/{0 <p < po,i} 



where 



(3.13) 



mo(p)J{po4 ^P <P*} + I{p* ^ p < 1} : 
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where po,i = 0.3878 . . . , mi{p) :— \ y ^p^' ™o(p) 2 log^ z ' where z is (for 
each p g (0,p,)) the only root in the interval (0, \/2) of polynomial 

(V - 24p + 16)^*^ + (-36p2 + 120p - 96)z5 + (36^^ - 216p + 2AQ)z^ 
+ (-20p3 + 60p^ + 120p - 320)2^ + (72^^ - leSp^ + 96p + 240)2^ 

+ (-96p^ + 144p2 _ I44p _ 96)z + 4/ + 40p^ - 44p^ + 48p + 16. 

(Here, as usual, I{A\ denotes the indicator of an assertion A.) 

On the other hand, one can show that a lower bound on mst(p) and hence on 

^st. Schur 

{p) is given by the inequality 

n^st,SchLr(p) > "^st(p) > "ist,iow(p) := max(l,mi(p),TO|ow(p)) 

= TOi(p)7{0 <p < piow.i} + n^iow(p)-f{piow,i s$ P < P*} + I{p* ^P < 1} , 

(3.14) 

where piow,i = 0.3889... and toiow(p) ■— 2(i+iog^2(i+p)) • appears that this 
lower bound, mst.iow(p) is very close to mst{p) and differs from the latter only 
for p in the rather narrow interval {po,i,p*) ~ (0.3878,0.4142), and at most by 
mst{p\ovj,i) — »7i5t,iow(piow,i) = 0.0008598 .... It follows from (|3.14(l that the lower 
bound = \/2 — 1 on p in l|3.13(l is exact; that is, mst{p) > 1 Vp S (0,p*). Also, 
the four boimds, ™st,iow(p) =^ ™st(p) ^ ^^st, Schur 

(p) ^ TOst,high(p), aU equal 1 for 
all p G [^,1) and are asymptotic to l/y/2p as p | 0; moreover, the difference 
between any two of these four bounds goes to as p | 0, since it is easy to see 
that mst,high(p) - mst,iow{p) ^ as p J, 0. 

In this "symmetric" setting, the place of supermartingales gets taken by 
conditionally symmetric martingales (that is, martingales (Si) with So — 
a.s. and conditionally symmetric differences Xi), and "bounded-asymmetry" 
condition (|3.7|l in Corollarv l3.4l gets replaced by "bounded- kurtosis" condition 

52 1 

4 <; - Vi = l,...,n. (3.15) 

cf p 

(Note that the kurtosis of a standardized symmetric three-point r.v. ST ^ ST{p) 
is i.) Thus, one now has another - "symmetric" - version of inequality H3.8() . 
which immediately implies a striking "symmetric" version of inequality H3.9|) : 
namely, in view of (|3.13|) and H3.15|l . inequality H3.9|l holds for all conditionally 
symmetric martingales if the condition bi = Ci Vi of i4j Theorem 1.3] is replaced 
by the much less restrictive condition 

c2 

6,2 <; _i — « 2.4142 Vi = l,...,n. 

V2-1 

In particular, one has the following generalizations of (|1.1|) and H1.5|l . respec- 
tively: 

E/(aiSri + --- + a„Sr„) < E/(Z) V/ G (3.16) 

and 

P (aiSTi + • • • + a„Sr„ ^ x) — P(Z ^ a;) Va; G R, (3.17) 

9 
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where ST; ''^^ ST(p) and p G [\/2 — 1, 1]; note that ST{p) coincides with the 
distribution of a Rademacher r.v. ii p = 1. 

Inequalities for the much simpler case when 6,; — b^i were obtained in [Sj |S] 
(in the "exponential" case) and in 

One also has multi-dimensional analogues of inequalities H3.16|l and (|3.17() . 
Namely, for all p e [\/2— 1 , 1] one has the following generalizations of inequalities 
((TT^ and (UnSl): 

E/(||SriXi + --- + Sr„x„||) E/(|Z|) for aU even / in (gjg) 

and 

P(||SriXi + --- + Sr„x„|| < ^ P(|Z| ^x) Va;eR, (3.19) 

with the Rademacher r.v.'s ei, . . . , replaced by STi, . . . , Sr„. 

Condition H3.15(l may be interpreted as a condition of boundedness of the 
kurtoses of the X^'s; cf. bounded-asymmetry conditions H2.6(l and H2.9|l . Note 
that the usual interpretation of the term "kurtosis" (as well as the term itself, 
meaning "peakedness" ) is not quite adequate, especially in contexts of large 
deviations. Indeed, without loss of generality, let us assume that a r.v. X is 
standardized, that is, zero-mean and unit- variance. Then the kurtosis of X is 
kurt(X) = EX-* = 1-|-E(X2-1)2 = l + Var(X2), which clearly is the measure of 
relative spread of the values of the nonnegative r.v. X^ about its mean 1. Thus, 
kurt(X) is large if and and only if at least one of the two tails, left or right, 
of the distribution of X^ on [0,cx)) is heavy. These two tails can be measured, 
respectively, by the "negative" and "positive" parts of kurt(X) — 1, namely 
kurt_(X) := E{X^-1)^I{X^ < l} andkurt+(X) := EiX^-l)^l{X^ > l}. If 
kurt_(X) is large, then the distribution oi X is peaked at 0; if kurt+(X) is large, 
then at least one tail of the distribution of X is heavy. Of these two reasons for 
the kurtosis of X to be large, the heaviness of the tails of the distribution of 
X seems more important than the peakedness, at least where large deviations 
are involved. Also, note that the kurtosis of X can be infinite only if the tails 
of the distribution of X are heavy enough. As was noted, the kurtosis of a 
standardized symmetric three-point r.v. ST ~ ST(p) is i, which is decreasing 
in p; at that, both the negative part kurt_(Sr) — I — p and the positive part 
kurt+(Sr) = 4sinh2ln^ are decreasing in p. More generally, it is not difficult 

to show that for any functions / G J-'^ (except for / of the form f{x) = a + bx^ 
for some nonnegative real constants a and b) the generalized moments E/(Sr) 
and hence E/(ciSri + • • • -I- c„Sr„) are strictly decreasing in p G (0, 1], where 

Sr, Sr, ~ ST(p) and the q's are nonzero real constants. Thus, inequality H3.16|l 
for p e [\/2 —1,1) not only generalizes but also strengthens inequality Hl.l|l . 
Similarly, inequality (|3.17(l is stronger than inequality H1.5|l for p e [\/2 — 1,1) 
and large enough x. 

It may also be instructive to compare inequality H3.17|) with available asymp- 
totic results. For instance, it follows from ^B] or |^ that for every p E (0, 1) 
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P(Sri + --- + Sr„^x)-P(Z;ja;/v^) (3.20) 

provided that rt — > oo and x ~ o{n). The advantages of this asymptotics over 
inequahty l|3.17|l are (i) that H3.20|l holds for every p S (0, 1) and not just for 
p G [V^— 1, 1] and (ii) there is no "extra" constant factor (such as 2e^/9) in the 
right-hand side of (|3.2()|l . However, these advantages seem to be counterbalanced 
by the following: (i) the asymptotic relation H3.20|l . without an upper bound 
on the rate of convergence, is impossible to use in statistical practice when one 
needs to be certain that the left-hand side of (|3.2U|I does not exceed a prescribed 
level; (ii) the asymptotics is valid only in the zone x = o(n), and this zone is 
defined only qualitatively; (iii) the summands STi, . . . , Sr„ in the left-hand side 
of (|3.2U|) are identically distributed; if coefficients ai , . . . , a„ are allowed in H3.2U|I 
(as in H3.17|l ). they cannot differ too much from another, and also then the zone 
X = o{n) must be narrowed; (iv) there is no "extra" constant factor (such as 
2e^ /9) in the right-hand side of the generalized-moment comparison inequality 
H3.16|l : (v) one can use the more precise upper bounds of the form given in (|1.9|l 
or (fO-^ instead of (PT7| . 

Details on this long remark, Remark 13.61 will be presented elsewhere. 

Remark 3.7. Directions of further research indicated in Remarks l3 . 51 and l3 . 5l can 
be combined. That is, one can at once replace (i) the class of generalized moment 
functions JF^ by the much narrower class (the latter being in effect the same 
as the class of all nondecreasing exponential functions) and (ii) the standardized 
Bernoulli distribution by the standardized symmetric three-point distribution. 
At that, one has another exact lower bound, say rust.expip), on m. Obviously, 

^st, exp 

(p) ^ TOst(p) for aU p g (0,1]. In particular, TOst,exp(p) = fnst{p) = 1 
for all p £ [\/2 — 1, 1]. Also, similarly to (|3.12() and in view of H3.11|l . one has 

?^st, exp 

(p) < mexp(r) < ^exp, up 

(r) for aU p e (0, \/2 - 1) (and even for all 
p G (0, |]), where as before r = (1 — y/1 — 2p)/2 is the root in (0, ^] of equation 
2r(l — r) = p- While, as noted, rrist^exp 

(p) = 1 for allp G [72- 1,1], one has 
mst.exp{p) > 1 for all p G (0, i). Indeed, if exp(p) — 1 for some p G (0, 1), 
then inequality H3.16|l holds for that same p and all / G . Taking now n = 1 
and f{x) = e'==" in (|XTB|) . and then letting k I 0, one has i = E ST^ ^ E Z'' = 3, 
whence P ^ 

Details on this remark. Remark 13. 71 will be presented elsewhere. 



3.2. Bounds on self -normalized sums 



(Details on the results presented in this subsection will be given elsewhere.) 
Efron 53 considered the so-called self-normalized sum 

V:= + , (3.21) 

assuming that the X^'s satisfy the orthant symmetry condition: the joint distri- 
bution oi SiXi, . . . , 6nXn is the same for any choice of signs Si, . . . ,5n G {1,-1}, 
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SO that, in particular, each Xi is symnietric(ally distributed). It suffices that the 
XiS, be independent and symmetrically (but not necessarily identically) dis- 
tributed. On the event {Xi = • • • = X„ = 0}, let ^ := 0. 

In Subsection l3 . 21 we assume throughout that the X^'s are all non-degenerate: 
P{X, = 0) < f for all i. 

Note that the conditional distribution of any symmetric r.v. X given \X\ is 
the symmetric distribution on the (at most) two-point set {X, ~X}. Therefore, 
under the orthant symmetry condition, the distribution of V is the mixture of 
the distributions of the normalized Khinchin-Rademacher sums aiei-|-- • ■+anSn, 
where = Xi/{Xl -|- • • • -I- X^)^, so that a\ ■ ■ ■ + = 1 (except on the 
event {Xi = • • • = Xn — 0}, where ai = ■ • • = a„ = 0). Hence, by (for 
f{x) = e-^"=, A ^ 0) and one has 

Ee^^=s;Ee-^^ VA ^ (3.22) 

and 

P{V-^x)i^ e-^'/2 ^ 0. (3.23) 

These results can be easily restated in terms of Student's statistic T, which is a 
monotonic function of V , as noted by Efron; namely, T = ^ V j ^\ — jn. 
Inequalities iflT^ and were improved in [T^l HSl HZj using ((rT|) (for 

/ G 7i4_) and (|1.5|l . so that one has 

E/(y)<E/(Z) V/e7^^ (3.24) 

and 

2e3 

P (F ^ x) ^ — P(Z ^ x) Vx e E. (3.25) 
9 

Multivariate analogues of these results, which can be expressed in terms of 
Hotelhng's statistic in place of Student's, were also obtained in [77] . 

It was pointed out in Tf, Theorem 2.8] that, since the normal tail decreases 
fast, inequality (|3.25|) implies that relevant quantiles of V may exceed the cor- 
responding standard normal quantiles only by a relatively small amount, so 
that one can use (|3.25|l rather efficiently to test symmetry even for non-i.i.d. 
observations. 

Here we shall present extensions of inequalities (|3.24() and (|3.25|l to the case 
when the X^'s are not symmetric, as well as improvements of (|3.24f) and (|3.25|l 
in the "symmetric case". Asymptotics for large deviations of V for i.i.d. X^'s 
without moment conditions was obtained recently by Jing, Shao and Zhou jl9| . 
Comments similar to the ones made at the end of Remark 13.61 apply here as 
well. 

3.2.1. The "asymmetric" case 

The basic idea here is to represent any zero-mean, possibly asymmetric distri- 
bution as an appropriate mixture of two-point zero-mean distributions. Let us 
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assume at first that a zero-mean r.v. X has an everywhere strictly positive den- 
sity function. Consider the truncated r.v. Xa^b '■= XI{a ^ X ^ b}. Then, for 
every fixed a G (—00, 0], the function 6 1— > E Xa,b is continuous and increasing on 
the interval [0,cx)) from EXa^ ^ to EXa^oo > 0. Hence, for each a S (— oo,0], 
there exists a unique value b S [0,cxd) such that EXa^t ~ 0. Similarly, for each 
6 G [0,00), there exists a unique value a S (— oo,0] such that EXa^b = 0. That 
is, one has a one-to-one correspondence between a G (— oo,0] and b € [0, 00) 
such that EXa.b — 0. Denote by r := rx the reciprocating function defined on R 
and carrying this correspondence, so that 

EXI{X is between x and r(x)} = Va; G M, 

the function r is decreasing on R and such that r(r(a;)) — x Vx E R; moreover, 
r(0) = 0. (Clearly, r{x) — —x for all real x if the r.v. X is symmetric.) One also 
has 

r(x) = x^{G{x))I{x > 0} + x+{G{x))I{x < 0}, (3.26) 

where x± (h) stand for the positive and negative roots x of the equation G{x) — h 
and, in turn, 

G{x) := E |X|J{|X| ^ \x\, signX = signx} . (3.27) 

Thus, the set { {x, r{x)} : x G R} of (at-most-)two-point sets constitutes a par- 
tition of R. Moreover, the two-point set {x, f{x)} is uniquely determined by the 
distance \x — r{x)\ between the two points, as well as by the product |x| |r(a;)|. 
Now one can see that the conditional distribution of the zero-mean r.v. X given 
W := \X — r{X)\ (or, equivalently, Y := |Xr(X)|) is the uniquely determined 
zero-mean distribution on the two-point set {X, r{X)}. Thus, the distribution 
of the zero-mean r.v. X with an everywhere positive density is represented as 
a mixture of two-point zero- mean distributions. This mixture is given rather 
explicitly, provided that the distribution of r.v. X is known. 

Thus, one has generalized versions of the self- normalized sum H3.21|l . which 
require - instead of the symmetry of independent r.v.'s Xi - only that the Xi^s 
be zero-mean: 

rr Xl + ■ ■ ■ + Xn 1 TA Xl + ■ ■ ■ + Xn , s 

Vw ■■= , , ^ and Vy.m ■= (3.28) 

ywi-r--- + wi (i"r + --- + 5"r)^ 

where to ^ 1, 

~ \X., - uiXi)l and Y := \X^ r,(XO|, 

and the reciprocating function := rx^ is constructed as above, based on the 
distribution of Xi, for each i, so that the reciprocating functions may be 
different from one another if the X^'s are not identically distributed. On the 
event {Xi = ■ ■ ■ = Xn = 0} (which is the same as either one of events {Wi = 
••• = W^n = 0} and {Yi = ••• = r„ = 0}), let Vw := and Vy,^ ■= 0. 
Note that Vw ~ Vy.i = V when the Xi^s are symmetric. Logan et al 22 and 
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Shao j!'{5| obtained limit theorems for the "symmetric" version of VY,m (with 
the reciprocating fmiction r(a:) = — x), whereas the XiS did not need to be 
symmetric. 

These constructions can be extended to the general case of any zero-mean r.v. 
X, absolutely continuous or not. Here, one can use randomization (by means 
of a r.v. uniformly distributed in interval (0, 1)) to deal with the atoms of the 
distribution of r.v. X, and a modification of the inverse functions x±{h) to 
deal with the intervals on which the distribution function of X and hence the 
function G are constant. Namely, in general r(X) is replaced by r(X, [/), where 
U is a. r.v. uniformly distributed in interval (0,1) and independent of X and, 
for x e R and u £ (0, 1), 



r{x, u) :- 



{G{x-) + u ■ {G{x) - G{x-))) if X e [0, c3o), 
x+{G{x+) + u ■ {G{x) - G{x+))) if X G (-00, 0], 

x+{h) := inf{a; G [0, oo] : G{x) ^ h}, 
x^{h) sup{a; G [-oo, 0] : G{x) ^ h}. 

By conditioning on the WiS or l^'s one obtains the following corollaries. 
Corollary 3.8. (From results of [5^:) 

E/(Vvi/) s$ E/(Z) yfenl, whence (3.29) 
P{Vw >x) £^ c5,o P{Z ^ x) \fx £ R, (3.30) 

where, in accordance with cs o = 5l{e/5)^ = 5.699 .... 

Corollary 3.9. (From Theorem 12 . 21 and Corollarv l3.1l ^ Suppose that for some 
p £ (0, 1) and all i £ {1, . . . ,n} 

I{X, > 0} s$ - a.s. (3.31) 



|r(X.)l ' ' P 

Then for all m ^ to* (p) 

E/(VV.„0 < E/(T„) V/G.F^ and (3.32) 
PiVy^m >x)i^ C3,0 P^^{Tn ^ x) Vx £ M, (3.33) 

where T„ and P^'"(T„ ^ x) have the same meaning as in Corollary and, in 
accordance with ((rTT|) . c^fi = 2e^/9 = 4.4634 . . .. 

Of course, one can replace the upper bound in inequalities like (I3.33|) by 
either of the more precise (but slightly less transparent and more difficult to 
compute) upper bounds given in (|3.3|) and (|3.6|) . 

By Remark 13.51 inequality H3.32|l will continue to hold for all m ^ TOexp(p) 
provided that the class J^^, is replaced by the smalled class correspondingly, 
for such m one will have the exponential upper bound on P{VY.m ^ x) of the 
form e~"^ as in (|3.5|l . 
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Condition H^.^lfl is likely to hold when the XiS are bounded i.i.d. r.v.'s. 

Note that the reciprocating function r depends on the (usually unknown in 
statistics) distribution of the underlying r. v. X. However, if e.g. the Xi constitute 
an i.i.d. sample, then the function G defined by (|3.27l) can be estimated based 
on the sample, so that one can estimate the reciprocating function r. Thus, 
replacing Xi + - ■ • + X„ in the numerators of Vw and VY,m hy Xi + - ■ • + X„ — n9, 
one obtains approximate pivots to be used to construct confidence intervals 
or, equivalently, tests for an unknown mean 0. One can also use bootstrap to 
estimate the distributions of such pivots. 



3.2.2. The "symmetric" case 



Here we assume that Xi, . . . , X„ are independent symmetric r.v.'s. In this case, 
we already have the upper bounds given by H3.24|l and (|3.25() . which are based 
on Hl.l|l and p.5|l . As was pointed out, bounds and (|3.17|) improve and 

generalize and 1)1.2(1 . Correspondingly, the upper bounds given by H3.24|l 

and H3.25|) can be improved and generalized as follows. 

Introduce another, "symmetric" modification of the standard self-normalized 
sum V given by (|3.21|l : 

V - ^1-^1 + - - - + ^»^» (o 

(|Xi|2™ + ... + |X„|2™)2;^ 

where p G (0, 1), m ^ 1, STi, . . . , Sr„ are i.i.d. ST(p) r.v.'s, independent also of 
Xi, . . . , Xn. On the event {Xi = • • • = X„ = 0}, let V;ymm,m,p := 0. 

Thus, the distribution of V^ymm.m.p is the mixture of the distributions of the 
linear combinations of the form aiSTi + • • • + a„Sr„, where — Xi/(|A"i|2™ + 
• • • + \X,,\^"')^^, so that laip" + ■ • • + |a„|2" ^ 1 (except on the event {Xi = 
■ ■ ■ — Xn = 0}, where ai — ■ ■ ■ — an = Q). 

By Remark l3.fil it follows that for all m ^ mst(p) 

E/(l^symm,™,p) E/(n-i/(2")(Sri + ... + Sr„)) V/e.F3, whence 

(3.35) 

P(l^symm,m,p ^ x) Cg^Q pLC(„-l/(2m) (gTi + • • • + ST,,) ^ x) Vx E R, (3.36) 

where P^*" again denotes the least log-concave majorant of the corresponding 
tail function. In particular (cf. H3.16|l and ((3.171) '). for all p e [V2 — l, l)j 

E /(Kymm,i,p) E fiu-'^^STi + ■■■ + ST^)) (3.37) 

^Ef{Z) y/eT^ (3.38) 

P(Kymm,l,p > x) ^ Cg^Q (n" (STi + ■ • ■ + ST,,) ^ x) (3.39) 

s; C3,o P{Z ^ x) \fxe R. (3.40) 

In view of the mixture representations of the distributions of V and V^ymm. m,p 
and the discussion after inequalities ((3.16() - ((3.19() . inequalities 1(3.35(1 - 13.40(1 for 
V^ymm, m,p gencralize/improve inequahties 1(3.22(1 - 13.251) for V. 
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The classic self- normalized sum V can be obviously used, e.g., as a test statis- 
tic to test the symmetry of the distributions of the Xts. However, it is not seen 
how V^ymm,m,p Can be used in symmetry tests, because its distribution will be 
always symmetric, even if the distributions of the XiS, are not. 

On the other hand, there are two issues with V: 

(i) V may have too light tails if the tails of the XiS are heavy enough, and 

so, the symmetry test based on such an upper bound as the one given by 
inequality H3.25(l may turn out to be too conservative and hence lacking 
some power; 

(ii) the way V (as well as its modifications considered so far - V, Vw, Vy^m, and 

Vsymm,m,p) "deals" with the event when all the X^'s take on the zero value 
certainly seems to be too conservative; note that this event may naturally 
occur with a nonzero probability if the distribution of Xi is discrete or if 
the original r.v.'s Xi are replaced by the corresponding truncated r.v.'s 
< bi} for some 6^ > (which may be done to increase the power 
of the test). However, note that usually the probabilities pi := P{Xi ^ 0) 
will be close to (even if less than) 1. 

To try to resolve these two issues with V, we shall suggest yet another mod- 
ification of it. Each of the other self- normalized sums introduced above - Vw, 
VY,m, and Vsymm,m,p " Can be modified in the same manner. 

Let us accompany any r.v. X with a r.v. of the form 

X := X + XI{X = Q} , (3.41) 

where X is a r.v. which is independent of X and whose distribution coincides 
with the conditional distribution of X given that X 7^ 0; thus, X = X \i X ^0 
and X — X \i X ^ so that P{X 7^ 0) = 1. It is not hard to see that, 
if X is symmetric, then the conditional distribution of X given \X\ coincides 
with the symmetric distribution -I- (1 — p)5q -t- f '^-x the three-point set 
{X, 0, -X}, where p P{X ^ 0): 

£(X|l) = |<5^ + (l-p)5o + f^_x- (3-42) 

Moreover, the distribution of X coincides (just as that of X does) with the 
conditional distribution of X given that X ^ Q. Indeed, X is equal in distribution 
to the first nonzero member of an infinite random sequence {X'^^\ X'^', . . . ), if 
X, X'''^\ X'^'^\ . . . are i.i.d. r.v.'s; therefore, one may assume that X equals (as 
a r.v., and not just in distribution) to the first nonzero member of the sequence 
{X, X'''^\ X'^'^\ . . .). Thus, roughly speaking, to get X, one samples from the 
distribution of X till getting a nonzero replica of X. Note also that for any even 
function g such that g{Q) = one has 

Var5(X) - Var5(X) - ^ {^^r g{X) - \[^g{X)f). (3.43) 

It follows from l|3.42|) that the distribution of yet another modification of the 
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self- normalized sum V , 

V - Xi + ---+Xn 

Vp(|XiP'" + --- + |X„|2™)2;;r 

is the mixture of the distributions of the linear combinations of the form 
aiSTi(pi) + • • • + a„ST„(p„), where the Sri(pi)'s are independent r.v.'s such 
that ^i{pi) - ST(p,) for all i and = + • • • + |X„|2")27^, so that 

laip'" + . . . + |a„|2™ = 1 a.s.; here, p S (0, 1), 

p ^ P(X, ^ 0) Vi, 

and each Xi is produced based on Xi according to formula 13.41|l . where the 
XiS are independent of one another and of the XiH. 

Recall that, for each i, one has P{Xi = 0) = and the distribution of Xi 
coincides with the conditional distribution of Xi given that Xi ^ Q. 

It follows that 

E/(v;ymm,™J E/(n-i/(2™)(sri + ... + sr„)) v/e^^^ (3.45) 

whence 

P(V;ymm,m,p ^ x) Cg.Q pLC(„-l/(2™) (STi + • • • + Sr„) ^ x) G R, (3.46) 

provided that 

m ^ TOst(p); 

in particular, if p 5^ V2 — 1 (which will typically be the case), 

E /(V;ymm.i,p) E f{n-^/\ST^ + ■■■ + STJ) (3.47) 
s;E/(Z) V/e.F3, (3.48) 

P(V;ymm.l,p x) ^ Cg^o P'^ («" (STi + • • • + ST,,) > x) (3.49) 

s; C3,o P(^ ^ a;) Vx e M; (3.50) 

here again the ST^'s are i.i.d. ST(p) r.v.'s. 

Note that esssupt^ymm.i,j9 = \fnjp, while ess sup = ^/n. Thus, the tails of 
Kiymm.i.p are longer in some sense than those of V , so that the symmetry test 
based on an inequality such as (|3.49|) or H3.50|l may be less conservative and 
hence more powerful than the corresponding test based on an inequality such as 
H3.25|l . especially if the size of the test is small enough. An interesting question 
is how the generalized moments E /(V^ymm, m,p) compare with E fiV) for f € J^^ . 

It may also be of interest to compare E /(\4yn,m,m,p) with E /(V^ymm.m,p) (as- 
suming that Pi = p for all i). It seems that neither of them dominates the 
other one in general. In view of H3.43|l . it seems likely that E /(t^ymm.m,p) will 
be greater than E /(l^ymm,m,p) if the tails of the distributions of the X^'s are 
not too heavy: then the variability of the \Xi\^s will be less than that of the 
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|Xi|'s and hence, heuristically, the tails of 14ymm, m,p wiU be heavier than those 

of V^ymm,m,p- 

Anyway, a definite advantage of V^ymm.m.p (over V and Vsymm,m,p) is that the 
denominator of its ratio expression in H3.44|l is nonzero a.s. On the other hand, 
an obvious disadvantage of 14ymm, m.p is that to compute its value one needs 
to know the distributions of the XiS. In statistical practice, V^ymm,m,p may be 
approximated (at least in the case when the XiS are i.i.d.) by replacing Xi for 
each i by where J(l), . . . , J{n) are (conditionally, given Xi, . . . , X„) i.i.d. 

r.v.'s, each J{i) having the uniform distribution on the set 

{jG{l,...,7i}:X, ^0}; 

this set is nonempty with a probability close to 1 if p is close to 1 or n is large; 
however, if this set happens to be empty, one can just set := 0, so that one 
has Xi = 0. It would be interesting to compare generalized moments and tails 
of this "practical" version of V^ymm,m,p with those of V^ymm, m,p itself. 

4. Proofs 

4.1. Statements of lemmas and proofs of the main results 

The proofs of the main results are preceded in this subsection by some definitions 
and a series of lemmas. At least one of them f Lemma 14 .1.121) may be of inde- 
pendent interest. The proofs of the lemmas are deferred further to Subsection 

Let us introduce more classes of functions, in addition to the classes Tii^, J-^^ 
Tt, and Q"^ (recall |TTJ, (j^T^ . fTTIjl . and (j^T^ : 

g\ := {/: 3a e M,6 e M,/i e Ti^ g ]g j^^,) = a ^ h x ^ h(x)\, (4.1) 
:= {/: 3a e M,6 ^ 0,/i e Vx e R /(.x) = a + 6 x + /i(x)}. (4.2) 

Remark 4.1. It is not difficult to see that, if a function / is in J-"^ or any other 
defined above class of functions, then the shifted function x t—^ f{x + a) is also 
in the same class, for any real constant a. That is, all these classes of functions 
are shift-invariant. 

Lemma 4.1.1. Suppose that a function g: M. R is convex and such that 
there exists a finite limit g{^oo) :— lim^_^_oo 3(2;); in particular, the latter 
condition will obviously be the case if g is nonnegative and nondecreasing. Then 
g'{—oo) — 0, where g' is the right derivative of g. 

Lemma 4.1.2. If a is a natural number then coincides with the class Ti." 
of all functions / : M — > R such that the derivative /("^i) is everywhere finite 
and convex, and /'*'•'(— 00) — ■ ■ ■ — /^"^"'^•' (— 00) = 0. Moreover, if f (z Ti.", then 
all the functions f'^'^\ ■ ■ ■ , /("^^) are nonnegative. 
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Lemma 4.1.3. Let / : M ^ R he a function such that f" is finite, nonnegative, 
nondecreasing, and convex, with /"(— oo) = 0. Then f ^ clQ^. If, moreover, f 
is nondecreasing, then / £ cl G"^^ ■ 

Lemma 4.1.4. One has J-\ = c\Q\^. 

Lemma 4.1.5. One has = cl^'^, where is defined by H2.15|l . 

Lemma 4.1.6. If f E T"^, then either f{x) — 0{x) as x oo or 
liminfj^^oo G (0, oo]. 

Lemma 4.1.7. If f E T\, then f(x) — 0(|a;|) as x ^ —oo. 
Lemma 4.1.8. ^G^. 

Proposition 4.2. There exists a function g E \ Q\. Since G\j^ Q G\, it 
follows that T\ ^ ^++- (This proposition complements Lemmas 14 . 1 . 4l and l4 . 1 . 81 
it will not be used elsewhere in this paper.) 

The following two lemmas are essentially well known. Their statements (and 
proofs) are given here for easy reference. 

Lemma 4.1.9. (Cf. e.g. and @1 Lemma 4.3].) Let X he a r.v. such that 
EX ^ and —a ^ X ^ a.s. for some positive real numbers a and b. Let 
BS - BS(p) with 

a 



Then 

E/(X)< E/(V^BS) 

for any nondecreasing convex function f , and hence for any function f G J-\. 

Lemma 4.1.10. If X is a zero-mean r.v., then E/(cX) is nondecreasing in 
c ^ for any convex function f and hence for any f G J-'^ . 

Lemma 4.1.11. Let BS(p) ^ BS(p). Then e{p) := E/(BS(p)) is nonincreasing 
in p E (0,1) for any f E H\_ and so, by p. 41) . for any f E 'H\, whence, by 
Lemma \4.1.4\ for any f E T'l. 

The extension from Theorem 12.21 to Theorem 12.41 to Theorem 12.51 is based in 
part on the following simple lemma, which may be of independent interest. 

Lemma 4.1.12. Suppose that for every i E {1, .. .,n} one has H2.8|l and H2.9|l . 

Then 

E /(5„) E /(ciBSi + • • • + c„BS„) (4.3) 

for any f E 7i^, and so, by p.4|l . for any f E Ti.'^, whence, by Lemma \4.1.4\ 
and Lebesgue's dominated convergence theorem, for any f E T^. 
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For m ^ 1, introduce 



2m + 1 - JA{m - l)(m + 2) + l 



(2m - 1) (2m + 1 + ^4(m - l)(m + 2) + 1) ' 

so that € (0, 5]- Introduce also 

5i{u, CP, m) := 2c(l - c^"-^)^ + 2pc(l - c^'"-!) + 0^(1 - c^"-^). 
52{u,c,p,m) :=(l-p)(l-c2"-i)u2 

+ 2c(l - c2"-2)y + 2pc(l - c2"-l) + c2(l - c2"-3); 

53(u,c,p,m) := -c2™-1m2_2(c2™-1-cp + c2»w 

+ ((2c + c2 - 2c2" - c2™+i)p - c2™-i); 
Si{u,c,p,m) :=(l-c2"-i)p(l + c + ?i)'. 

Lemma 4.1.13. For any given pair (p, m) such that m ^ 1 and p G (0, 1), the 

following two statements are equivalent to each other. 

(i) (Cf. Statement (|V|l in Theorem 12.61 ) For every every function f G Ti.^, 
the function 

[0,00)" 9 {a^,a2) ^ E /(a^^'^^i + a^/^'^'BS^), 

is Schur- concave, where BS^ BS(p), i = 1,2. 

(ii) for all u CzM. and c G (0, 1), one has the inequalities 

Si{u, c,p, m) I{u ^ 0} ^ 0; 
52{u,c,p,m) I{-c < w ^ 0} ^ 0; 
^3(u,c,p, m) J{-1 < u ^ -c} ^ 0. 

Lemma 4.1.14. For aZZ w ^ 0, c G (0,1), p G [p*,l), and m ^ 1, one /las 
5i(it) := Si{u,c,p,m) ^ 0. 

Lemma 4.1.15. For all u G [— c, 0], c G (0, 1), p G [p,, 1), anc? m ^ 1, one has 
52{u) := (52(u,c,p, m) ^ 0. 

Lemma 4.1.16. For all u G [— 1,— c], c G (0,1), p G [p*,l), and m ^ I, one 
has ^3(u) :— 5^lu,c,p,m) ^ 0. 

Lemma 4.1.17. (Recall (j2.3(l and (|4.4|l .l For p G (0, 1) and m ^ 1, one has 

m^m^{p) <;=^ p^p^{rn). 

Lemma 4.1.18. In the context of Theorem, \2.b\ implication =J> (jljl is 

trae. 
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Proof of Theorem \2.f\ It suffices to prove the implications 

(m=»iivi)=»(iivt^mii=»(iiTi=^m and 

jni) dvnj ivD =^ CH). ^ ' ' 

CJ 101: Suppose that condition m > m*(p) of item iQJ takes place. By 

Lemma [4.1.171 this condition is equivalent to p ^ j'*(m). Now statement (EJ 
with n\ in place of J^f follows from Lemmas OTH 14. 1 .141 [4. 1 . 151 and l4.1.16l 
Hence, by the definition H4.2|l of Q\j^^ one has (0 with Q\j^ in place of !F\. To 
complete the proof of implication lP) => (0), it remains to use Lemma [4. 1.41 
recall that the r.v.'s BSi each take on only finitely many (namely, two) values. 

^ By the well-known result by Muirhead jH] (see, e.g., US 

Remark B.l of Chapter 2]), a function of n nonnegative arguments is Schur- 
concave iff it is Schur-concave in any two of its arguments. Now implication 
ifvjl (|IV(I follows in view of R,emark l4.1l on page 123 by conditioning on all 

of the r.v.'s BSi, . . . , BS„ except any given two of them. 

((IV|| =4> (jni): Let here oi := cf a„ := c^™ and a := (oiH |-a„)/n, 

so that s^™) — a^/(^™^ Note that (ai,...,a„) ^ (a,..., a). Now implication 

n 

(|IVJ ^ (HH) follows. 

(HD HIII|) : This implication is trivial. 

(|lll|l =^ (P): This implication is true by Lemma 14.1.181 

(|lHl =^ IjVIip : This implication follows immediately from Lemma [4. 1.1 21 

(|VIIII IVII) : This implication is trivial. 

(|Vip iQlJ: This implication is also trivial. □ 

Proof of Theorem \2. ?| Replace all functions / in Theorem 12.61 with their re- 
flections / defined by f{x) :— f{~x) for all real x, replace all r.v.'s X there 
with —X, and interchange p with q, Oi with hi, and Ai^i with Bi-i. Note that 
(i) f{~X) = f{X)- (ii) EX 4=^ E(-X) ^ 0; (iii) X ^ BS(p) ^ 
— X ~ BS(g); and (iv) {So, ■ ■ ■ ,Sn) is a supermartingale with 5*0 ^ a.s. iff 
{—So, . . . ,—Sn) is a submartingale with —So ^ a.s. Thus, Theorem l2.7l fol- 
lows immediately from Theorem 12. fil □ 

Proof of Proposition \2.fA This follows immediately from Lemmas l4 . 1 . 5l and l4 . 1 . 81 

□ 

Proof of Theorem \2.1(A It suffices to prove the same implications, (14. 5|) . as in 
the proof of Theorem 12.61 only with the changes stated in the formulation of 
Theorem 12.101 Below, all these implications are understood in the context of 
Theorem 12.101 The proofs of most of these implications are similar to their 
proofs in the context of Theorem 12.61 Below, only the most significant changes 
are described. 

© =^ 10): To prove this implication, in view of Theorems 12.61 and 12.71 
and Proposition l2.9l it suffices to verify that the function H2.12|l is Schur-concave 
when f{x) — (for all real x) and n — 2. Thus, it suffices to verify that, for 



imsart ver. 2005/10/19 file: arxiv_v2.tex date: February 2, 2008 



losif Pinelis/ Inequalities for sums of asymmetric random variables 



29 



any given m ^ 1, the expression 

E(BSi cos^/™ 6* + BS2 sin^/" Of = cos^/" 9 + sin^/™ 6* 

is nondecreasing in 6* e [0,7r/4]. But this is easy to see. 

=^ (P) : This impHcation follows from Theorems 12.61 and 12.71 and the 
observation that both classes and are contained in . 

inil =^ (j^nj: In view of Theorems E31 and O Proposition EHl and 
Lebesgue's dominated convergence theorem, it suffices to verify that inequal- 
ity (|2.11|) holds when f{x) = for all real x and (5o, . . . ,Sn) is a martingale as 
described in the formulation of Theorem 12. 101 Note that inequality (|4.3|) holds 
for the function /o(a;) :~ x"^ in place of /, since /o G GX- It also holds for the 
function /o(a;) := {—x)\ in place of /, given that (S'q, . . . , Sn) is a martingale 
as described. Thus, (|4.3|l holds when f{x) = a;^ for all real x. It remains to note 
that 

E(ciBSi + • • • + c„ffi„)2 ^ ,j (5(i))2 <; n {s^^'^f = E(s(") . (ISi + . . . + BS„))2, 
so that one does have inequality (|2.11() when J{x) ~ x"^ for all real x. □ 



4-2. Proofs of the lemmas 



Proof of Lemma \4-.1.1\ The convexity of g implies .g(0) — g{x) ^ g' {x){—x) and 
g{2x) - g{x) ^ g'{x)x, so that |.9'(x)||a;| max(|5(0) - |5(2a;) - g{x)\) 

for all a; G M. Letting now x — > —00 and using the existence of the finite limit 
g{—oo), one has g'{—oo) =0. □ 

Proof of Lemma \4.1.^ This lemma was stated essentially as Proposition 1.1 in 
|31|. The proof given here is a little more detailed. Assume first that / e , 
so that f{x) = J{x — t)'^d^i{t) for a function ^ e A^", whence /'■"^^'(x) = 
a! fix — t)+ d^{t) is convex as a limit of linear combinations with nonnega- 
tive coefficients of convex functions x {x — The conditions /(*'^(— 00) = 
• • • = f^°'~^\—oo) = follow by Lebesgue's dominated convergence theorem. 
Thus, / £ Ti'^. Moreover, it is clear that all the functions f^^\ . . . , /(""^^ are 
nonnegative. 

Assume now that / G TL°^_. Consider first the case a = 1. Then / is convex 
and /(— 00) = 0. Hence, by Lemma [4. 1.11 one has /'(— 00) = 0. Therefore, 

/(^) - /-oo/'(") - /!^du/!^d/'(«) = j\x - v)+A^i{v), 

by Fubini's theorem, where /x := /'; thus, / G T-L\. The case of any natural 
a 5^ 2 can now be treated by induction, in a similar manner. Indeed, if / € 
for a natural a ^ 2, then /' e 7Y"~^, by the definition of 7i". Hence, for a 
function /i G Al", 

so that / e H". □ 
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Proof of Lemma \4.1.3\ For the given function / and any y e M, introduce the 
functions defined by the formulas 

f2A^) ■■= ifiy) + f"'{v){^ - y))+ ^v} + /"(^) > y} ; (4.6) 

fyix) ■■= (fiy) + fiyKx -y) + r_^f2.An) (u - x)+ du) I{x sC y} 

+fix)I{x>y} 

for all real x. Here /"' denotes the right derivative of the convex function /", 
so that /"' is nondecreasing. Note that 

/; = f2.y (4.8) 

Since /" is convex, one has 

n^)^ ny) + f"'{y){x-y) (4.9) 

for all y and x; also, it is given that /" is nonnegative; it follows that 

/" ^ /2.,. (4.10) 

Observe that, moreover, the family of functions {f2,y) is nonincrcasing in 
?/ G M. Indeed, let y and yi be any real numbers such that yi < y. Then f2,yi = 
f" > f2,y on [yi, oo), in view of H4.6|) and H4.10|l . Recalling (|4.9|l and the fact that 
/"' is nondecreasing, one has the inequalities f"{yi) ^ f"{y) + f"'{y){yi — y) 
and f'"{yi){x — yi) ^ f"'{y){x ^ yi) for all x ^ yi; adding these inequalities, 
one sees that f"{y,) + f"'{yi){x - yi) ^ f"{y) + f"iy)ix - y). It follows, in 
view of H4.6|l . that /2.yi ^ f2.y on the interval (—00, yi] as well, and hence on 
the entire real line. 

Using integration-by-parts/Fubini's theorem as in the proof of Lemma [4.1. 21 
one can verify that for any function g (z and all real y and x 

9{x) = {aiy) + 9'{y){x ~y) + - ^)+ d") ^ y} 

+g{x) I{x > y} . 

By (|4.8|l . for any real w one has € C^, so that one can substitute for g in 
H4.11|l . In fact, let us do so for w € {y, yi}, again assuming that yi < y. At that, 
by (|4.7|l . one has fyi=f = fy on the interval [y, 00) and hence fy^iy) = fyiy) 
and /yj(y) = fyiy)- Now, since the family of functions {fy)ym = {f2.,y)ym is 
nonincrcasing, one can see that the family (/i,)j,gR is nonincreasing as well. Next, 
since I{x ^ y} — > and I{x > y} ^ 1 for each x a,s y ^ —00, one concludes, 
in view of (|4.8(l . that for any decreasing sequence (yn) in R converging to —00 
one has fy^ ^ /, in the sense of Definition 12. 81 

It remains to verify that for every real y one has fy G Q"^ and, moreover, 
fy G 5++ in the case when / is known to be nondecreasing. Observe that 

/2,,(x) = /'"(y) (x -z)I{z^xi^y} + fix) I{x > y} , (4.12) 
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where 

z yl{r{y) = 0} + (y - /{/'"(y) ^ 0} . 

Indeed, /"' is nonnegative and nondecreasing (since /" is nondecreasing and 
convex). Hence, in the case when f"'{y) = 0, one has /"' = on the entire 
interval {—oo,y]. This and the condition /"(— oo) ~ imphcs /" = on the 
entire interval (— oo,?/], so that /"(y) = 0. Now one sees that expressions (I4.6|l 
and (|4.12(l both equal f"{x) I{x > y} in the case when f"'{y) = 0. In the other 
case, when f"'{y) ^ 0, one has f"'{y) > (since /"' is nonnegative). Also, here 
f"{y) + f"'{y){x ~y) = f"'{y) {x~ z), whence (10^ again follows. 
Now, for the right derivative y of /2,t,, (|4.12|) yields 

l2,y{^) = f'iv) I{z^x^y} + f"[x) I{x > y} . 

Since /"' is nonnegative and nondecreasing, it follows now that /2 y is nonde- 
creasing. Therefore, /2.J, is convex. That is, by l|4.8(l . fy is convex. Also, H4.12|l 
and (|4.8|l show that fy = on the interval (— 00, z]. This means that 

fy{x) = a + bx for some real constants a and b and all x ^ z. (4-13) 

Let now 

hy{x) fy{x) - (a + bx) 

for all real x. Then hy = fy is convex. Moreover, hy — on the interval (—00, z], 
so that /ly(-oo) = h'y{-oo) = h'^{-oo) = 0. By Lemma ITO hy g H%. Thus, 

If, moreover, / is nondecreasing, then /' ^ 0. Hence and because (in view of 
H4.10|l l f"{u) ^ f2.y[u) ^ f2.y{u) I {u > x} for all x, y, and u, one has 

fyi^) = (/'(y) - r-o.h,yH > ^} d") ^ y} + /'(^) > y] 

^ /'(-(x.) I{x ^y} + fix) I{x >y}^0 
for all X. Now H4.13(l implies 6^0. Since /ly G H^, one finally sees that fy e 

Proof of Lemma \4-1.4\ First note that clJF^ = JT^, because the pointwise con- 
vergence preserves both the monotonicity and the convexity. 
Next, take any / e 5++, so that 

/(x) =a + bx + J{x-t)\ d^l{t) 

for all X, where a G K, 6 > 0, and /x is nondecreasing and J {—t)\ diJ,{t) < 00. It 
follows that / is nondecreasing and convex, since the functions x 1-^ (x — t)^ are 
so. Similarly, /" is nondecreasing and convex, since f"{x) —6 J{x — t)+ dfi{t). 
That is, f e^l for any / G Ql^, so that C J^l, whence cl0^+ C clJ^^ = 

It remains to show that !F'l C clCy^J^^. Take any / e J^i^. Then, by defini- 
tion 1)2.1(1 . / and /" are nondecreasing and convex. Hence, /' is nonnegative, 
nondecreasing, and convex. Now Lemma f4. 1 . II yields /"(— 00) = 0. Also, /" is 
nonnegative, since / is convex. Thus, by Lemma [4. 1.31 / G clQ'l^. □ 
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Proof of Lemma \4-1.5\ First note that c\J-^ — J-^, because the pointwise con- 
vergence preserves the convexity. 

Next, it is trivial that QT^, whence cl^^ CclT^ =T^. 

It remains to show that C cl^^^ ^ake any / G J-'^. Then, by definition 
l|2.14|l . / and /" are convex. The latter condition implies that at least one of 
the following three cases must take place: /" is nondecreasing on R or /" is 
nonincreasing on M or /" switches from nonincreasing to nondecreasing. 

Case 1: f" is nondecreasing on M. Since / is convex, /" ^ on R. Hence, 
there exists the limit c := /"(— oo) G [0, oo). Let 

g{x) := fix) -0x^2 

for all real x. Then g" = f" — c = /" — /"(— oo) ^ 0, since /" is nondecreasing. 
Also, g" = f" — c is nondecreasing and convex, since /" is so. In addition, 
g"{—oo) = 0. Therefore, by Lemma 14.1.31 g G clQ'l- That is, there exists a 
sequence of functions such that g„ g and 

g„{x) = an + bnx + hn{x) 

for all real x, where, for each n, a„ and 6„ are real constants and hn is a function 
in T-i\] then hn G T\ (because T-i\ C as seen, for example, from the proof 
of Lemma I4.1.2|l . Let now 

fn{x) := cx^ /2 + gn{x) = cx^ /2 + a„ + 6„ a; + hn{x) 

for all X and n. Then gn — » g implies /„ —t f. Moreover, for every n one 
has /„ G ■ Indeed, if 6„ < 0, then the function hn belongs to T\_ and the 
function x i-^ a„ + 6„ x belongs to JF^; and if 6„ ^ 0, then the function x 
a„ + 6„ a; + hn{x) belongs to and the function x belongs to T^. Thus, 
f G c\Q^ for any / G satisfying the condition of Case 1. 

Case 2: f" is nonincreasing on M. This case reduces to Case 1 by con- 
sidering the function x i-^ f{x) f{~x) in place of /. Indeed, if / G 
and /" is nonincreasing, then / G J-'^ and /" is nondecreasing. Moreover, 
fn G -^=^ fn&G^, where /„(x) := fn{-x) for all real x. 

Case 3: There exists some real xq such that f" is nonincreasing on (— oo, a;o] 
and nondecreasing on [a;o,oo). Here without loss of generality (w.l.o.g.) xq = 
0. Let 

g{x) /(x) - /'(0)x - /"(0)xV2 

for all real x, so that g{Q) — /(O) and g'(0) — g"{0) = 0; moreover, g" = 
f" ~ /"(O) is convex on K (since /" is so), nonincreasing on (— oo, 0], and non- 
decreasing on [0, oo), whence g" ^ 0. Let, for all real x, 

h^[x) := g"{x)I{x > 0} and h_{x) := g"{x)I{x ^ 0}, 

so that h+ + h- = g" , h± ^ 0, h+ is nondecreasing, and h- is nonincreasing. 
Also, /i+ and h- are convex, since g" is convex and g" ^ g"{0) = 0. Let further, 
for all real x, 

H+{x) -.^ J {x -t)+h+{t)dt and H^{x) := J {t - x)+ h^{t) dt, 
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SO that iJ±(0) 0, 

/ I{x>t} h+{t)dt^ Jlg"{t)dtl{x>0}^g'{x)l{x>0}; 
g"{x)I{x > 0} = h+{x); 

-JI{t > x} h_{t)dt = -Jlg"{t)dtl{x 0} = g'{x)I{x ^ 0}; 
g"{x)I{x s; 0} h^{x). 

It follows that H'^ + H'_ ~ g', whence for all real x one has H+{x) + H^{x) = 
g{x) ~ g{0), that is, 

fix) = /(O) + f'{0)x + f"{0)xy2 + H+{x) + H^{x). (4.14) 

Also, = /i+ is nonnegative, nondecreasing, and convex, and hence iJ+ is 
also convex. Also, since ^ 0, the first expression for H'^{x) above shows that 
-ff+ ^ 0. Thus, iJ+ and are nondecreasing and convex; that is, G J-^. 
Similarly, e T^. 

Note also that /"(O) ^ 0, since / belongs to and is hence convex. If 
/'(O) < 0, then the function iJ+ belongs to and the function x ^ /(O) + 
/'(0)x + H^{x) belongs to Tb, and if /'(O) ^ 0, then the function x ^ /(O) + 
f'{0)x + H+{x) belongs to and the function iJ_ belongs to T^. Thus, H4.14|l 
implies that f G c\Q^ for any / e JF'^ satisfying the condition of Case 3. 

One concludes that, in all cases / G implies f E c\Q'^. That is, C clQ^ 
indeed. □ 

Proof of Lemma \4-1.6] Let / e !F'l. 

Case 1: f" — Q onR. Then there exist real a and b such that f{x) — a + bx 
for all real x, so that /(a;) = 0{x) as x — > oo. 

Case there exists some t gR such that f"{t) ^ 0. By (I2.1|l . /" is nonneg- 
ative (because / is convex) and nondecreasing. Hence, f"{t) > and f"{x) ^ 
f"{t) for all x^t.lt follows that /(x) ^ /(t) + /'(t) (x - i) + f"{t) (x - i)V2 
for all X ^ t, whence hminfa;^oo f{x)/x'^ ^ f"{t)/2 > 0. □ 

Proof of Lemma \4-1.7\ Let / S !F^. Since / is nondecreasing, one has f{x) ^ 
/(O) for aU X < 0. On the other hand, /(x) ^ /(O) + /'(0)x for aU real x, 
since / convex. It follows that |/(x)| ^ |/(0)| + |/'(0)x| for all x ^ 0, so that 
f{x) = 0{\x\) as X ^ —oo. □ 

Proof of Lemma \4-1.8\ For all real x, let 

/(x) := f (1 - x)^/2 I{x < 0} + (f - 4x + .x^ + i x^ + x*) I{x > 0} . (4.15) 

Then it is easy to see that / G T^. 

Suppose that f G . Then, by (|2.15|) . there exist c ^ 0, /+ G J"^, and 
/_ G J-^ such that for all real x 

/(x) = cxV2 + /+(x) + /_(x). 

Let X — > — cxD. Then, by Lemma f4. 1.71 /+(x) = 0(|x|), while by Lemma f4.1. 61 
either /-(x) = 0(|x|) or liminf2,_»_oo /-(a^)/a;^ > 0. It follows that cither 
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f{x) — 0{\x\) as X — > — oo or liminfa-^-oo /(a;)/a;^ > 0. However, neither 
of these two alternatives is compatible with the fact that f{x) = | (1 — x)'^/^ 
for X ^ 0. Thus, / e \ □ 

Proof of Proposition Let g :— /', where / is defined by (|4.15(l . Then it is 
easy to see that g e 

Suppose that g ^ Q\. Then, by H4.1|l , there exist real a and b and h e TL^. such 
that for all real x one has g{x) = a + bx + h{x) and hence g'{x) — b + h'{x). By 
Lemma [4.1.21 h{—oo) = oo) = 0. Hence, b = g'{—oo). But, by inspection, 
g'{—oo) — 0. It follows that 6 = 0, and so, g{—oo) = a + h{—oo) = a E R. This 
contradicts the fact that g{—oo) = — oo. □ 

Proof of Lemma \4 .1.91 Write 

X + a b-X , , 
X^- b+- -a . 

b + a b + a ^ ' 

Let / be any nondecreasing convex function. The convexity (together with the 
condition —a ^X ^b a.s.) implies that 

/(^) < m + ^ /(-«) - E /(^/^BS) + m-J^ X 

b+a o+a b+a 

a.s. Now, since / is nondecreasing and EX ^ 0, the lemma follows. □ 

Proof of Lemma \4-.l.ll\ Since / is convex, the function [0, oo) 9 c i-^- f[cX) is 
convex as well. Hence, the function [0,oo) 9 c i— *■ g{c) := E/(cX) is convex. 
Since EX — Q, one has g{c) ^ g(0) for all real c, by Jensen's inequality. There- 
fore, the right derivative of g is nonnegative at and hence on [0,oo). Now the 
lemma follows. □ 

Proof of Lemma \4.1.1l\ In view of the definition of the class 'H\, it suffices to 
verify the statement of the lemma for all functions / of the form ft{x) := (x—t)'^, 
for all real t, so that e{p) = E(BS(p) — t)'^. Then 

K 0. 



□ 



Proof of Lemma \4.1.1!^ W.l.o.g., the BS^'s are independent of the SiS. For 



0, 1, . . . , n and / G TYT , introduce 



F^-.^Ef [Si + c,;+iBSi+i H 1- c„BS„) . 

Recall that, by Remark 14.11 the classes 7i" arc invariant with respect to the 
shifts. Hence, by Lemmas 1^:^14. 1.1 01 and 14. 1.1 II 

Fi = E Ei_i f{Si-i + X.i + Cj+iBSi+i H h c„BS„) 



< E Ei_i /(S*!-! + Ai-iBi-i BSi + Q+iBS,:+i H h c„BS,: 

^ E Ei_i f{Si-i + Ci ffii + Ci+iBSi+i H h c„BS„) 

< E Ei_i f{Si-i + CiBSi + Ci+iBSi+i H h c„BS„) 
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for i = 1, . . . ,n, where Ei_i denotes the conditional expectation given the a- 
algebra G,;_i generated by and (BS^+i, . . . ,BS„), and the conditional dis- 

tribution of BSi given G^-i is BS(pi), with 

so that Pi ^ p, according to (|2.9|l . Hence, 

EfiSn) = Fn ^ Fo ^ E/(ciBSi + • • • + c„BS„); 

the last inequality follows because Sq ^ a.s. and all functions / G 7{\_ are 
nondecreasing. □ 

Proof of Lemma \4.1.1^ Statement (i) is equivalent to the following: for every 

1 G M, the function 

[0, oo)^ 3 (ai, 02) I — > et,p,m{ai,a2) := E /t(a}/^^'"''BSi + a2^*'^"-'BS2) 

is Schur-concave, where BSi ''^^^ BS(p) and 

ft{x) ■.= l{x-t)l. 
Using the homogeneity property 

et,p,m(Aai, Aa2) — A"^/^^™^ ex-i/i^^) t,p,m(ai, 02) 

for every A > 0, one may assume w.l.o.g. that 01+02 — 1, so that oi — cos^ 9 and 

02 = sin^0, for some 9 G [0, 7r/2]; moreover, in view of the same homogeneity 
property, one may replace here the i.i.d. standardized Bernoulli r.v.'s BSi and 
BS2 with i.i.d. centered Bernoulli r.v.'s BCi and BC2, such that 

P(BC, = =p= 1- P(BC, = -p), z = l,2. 

Therefore, statement (i) is equivalent to 

^p,n{9,t) :=ae(E/t(BCi cos^/" + BC2 sini/" 0)) (4.16) 

being nonnegative for all 9 G [0,7r/4] and all t G M (where do :~ d/d9), which 
is in turn equivalent to 

Al,P,n^(^,^i) , cosi-i/™^ sini-i/™0 

{l-p)p 

X Ap,„(6l, -u-p cos^/" 9-p sin^/" 9) 

= A2,p,™(cosi/'" 9, sini/™ 9, u) (4.17) 
being nonnegative for all 9 G [0, 7r/4] and w G M, where 

A2,p,m(ci, C2, u) : = 

_ (c^™-! _ cl^~^)qu\ - (c^^-lq + c^-^p) (ci + u)\ 

+ (c2™-lg + C^™- V) (C2 + U)l + {cX"-^ - C^™-1) p (Cl + C2 + U)\. 
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Now, in view of the homogeneity relation 

A2,p,m(ci,C2,u) = cf"+^ A2,p,,„(l,C2/ci,u/ci), where < C2 < ci, 
statement (i) reduces to A2,p,m(l, c, u) being nonnegative for all c e (0, 1) and 
It remains to note that 



A2,p,m(l,C, U) = < 



(5i(u,c,p, m) if u ^ 0, 

S2{u,c,p,m) if u £ [—c,Q], 

63{u,c,p,m) if ue[-l,-c], (4-18) 

S4{u,c,p,m) if 1 — c, — 1], 

otherwise, 



and S4^{u,c,p,m) is manifestly nonnegative for all c G (0, 1), u G M, p G (0, 1), 
and m > 1. □ 

Proof of Lemma \4-1.14\ Note that ^ Si{0) for u ^ 0, c G (0, 1), and m ^ 

1. Next, (|4.18() shows that Si{0) = (52(0). Now the lemma follows immediately 
from Lemma [4. 1.151 □ 

Proof of Lemma \4-1.15\ W.l.o.g., m > 1. Note that 

dpS2{u,c,p,m) = (2c-m2)(1-c2™-1) >0 
for u G [— c, 0], c G (0, 1), and m > 1, so that w.l.o.g. 

p = p^ ^p^{m). 

Next, (52 (u) is a convex quadratic polynomial, whose minimum over all u G M is 
attained at 

c(l-c^-"^) 
u = u.[c^p,rr^) :=-^-_5_^^^. 

Hence, it suffices to show that 7(c,p*) is nonnegative for all c G [0, 1] and m > 1, 
where 

7(c,p) := d2{u^{c,p,m),c,p,m) 

c 

= _2(1 - c2'»-l) V + (1 - c2™-l)(2 - C + c2"-2 _ 2c2™-l)p - c2™-2(^ _ 

The main idea in the proof of this lemma is to replace here the entry of p^ with 
the equivalent (for p = p*), first-degree in p polynomial expression according to 
the identity 

to obtain 

{2m - 1)^7(0, p,) = f{c,p^), 
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where 

f{c,p) (1 - c2™-i)2 - (2m - 1)^(1 - cfc^"'-'^ 
+ p (1 - c2"-i)(2m - l)((2m - 3)(1 - c^™"!) - (2m - 1) c(l - c^""-^)). 

It sufSces to show that f{c,p) ^ for all p e (0,1), m > 1, and c e (0,1). 
Introduce 



9{c) 


:=/(c,p)/c2™; 


9i{c) 


:=g'(c)/c2'"-^ 


52(c) 


:=5'i(c)c2; 


53(c) 


:=.gi(c)/ci-2'" 


54(c) 


:=53(c)/c^~^'" 


55(c) 


:=g;(c)/c2'"-4 



Then, letting 

s := m — 1 > 0, 

one has 5^(0) = Sc'^-^^ s (1 + s)(l + 2s)2(l + 4s)(l - p + 4s2p) > for aU 
c e (0, 1), and so, 55 is increasing on (0, 1) to 

55(1) = -16s(l + s)(l + 2s)2(l - p + s + 4s2p) < 0. 

Hence, 55 < on (0, 1), so that 54 is decreasing on (0, 1) to 

34(1) = 8s(l + 2s)2(l-p + s + 4s2p) >0. 

Hence, 34 > on (0,1). Since 33(1) = 52(1) = 5i(l) = 5(1) ~ 0, it follows 
successively that 53 < 0, 52 > 0, gi < 0, and g > on (0, 1). This completes the 
proof of Lemma 14.1.151 □ 

Proof of Lemma \4-1.16\ This follows because 63{u) is concave in u, <53(— 1) = 
c^(l — c^"^~^)p > 0, and i53(— c) = S2{—c) > 0, where the latter equality and 
inequality follow immediately from H4.18|l and Lemma [4. 1.1 51 respectively. □ 

Proof of Lemma \4.1.17\ It is clear from the second expression for p^{m) in (j4.4|l 
that pi,{m) decreases continuously from ^ to as m increases from 1 to 00. Also, 
one can verify that m»(p^(m)) = m for all m ^ 1 (here one may use identity 

(13113). 

If now p G (0,p*(m)), then p — p*(mi) for some mi > m, whence m < mi = 
m*(p*(mi)) = m^{p). 

It remains to consider the condition p ^ p^,{m). If at that p > ^, then 
m, ^ 1 = m^,{p), by (|2.3|l . If, however, p £ [p*(m), i], then p = (mi) for some 
mi e [l,?7i], whence m ^ mi = TO*(p*(mi)) ~ ra^{p). □ 
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Proof of Lemma \4.1-lS\ Suppose, to the contrary, that statement of The- 
orem is true, while m < m^{p). Then, by Lemma 14.1.1 71 one has 

p < p*{m); 

then, in particular, one has < p < ^. Introduce 

2i-27^(m-l) 



^' (2to-1)(1-p)' 

In view of the elementary inequality ^ l/(2m— 1) for to ^ 1 and the condition 
p < p*, one has 

which implies that 

-2"^ < Up s; 0. 

Taking into account these bounds on p and Up and employing notation intro- 
duced in the proof of Lemma [4. 1.1 31 one can see that 

. ,1 2i-V(2'»)(2m-l)(p-p.)(p-p.,) 
OgAi,p^rn(o,Up)\g^^/i ^ r , where 

' TO (1 — p) 

2m + 1 + J4:{m- l)(m + 2) + l 

p*, := p,4to) := — > p^. 

4(2m — Ij 

Because of the assumption p < p^, it follows that 

(6',Up)|e=7r/4 > 0. 

On the other hand. 



for all real u; this follows from H4.17|l . in view of the fact that the derivatives of 
pogi-i/m Q sini-i/" 9 and cos^/™ 9 + sin^^"' 9 in 9 at 9 = 7r/4 are zero. Hence, 

dgAp^rn{0,tp)\g=^/^>O for tp '.^ -Up ~ 2^-'^ ^^"'^ p. 

Note also that Ap_m(7r/4, i) — for all real t. Therefore, Ap_m(6',ip) < for all 
in a left neighborhood of 7r/4. Now (|4.16|l implies that 7r/4 is not a point of 
maximum in 9 of E/t(BCi cos^^"^ 9 + BC2 sin^^"^ 9) for t — tp. Hence, in view 
of the homogeneity argument used in the proof of Lemma [4.1.131 7r/4 is not 
a point of maximum in of E /t(BSi cos^/™ 9 + BS2 sin^/™ 9) for t ~ tp/ ^/pq. 
But, for any t G M, one has ft G 'H\ C (where the set inclusion follows by 
Lemma f4.1.2|) . Thus, one obtains a contradiction with the assumed statement 
plU) of Theorem 01 □ 



rn— 1 
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